NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Alignment Rationale for Query-Document Relevance

https://doi.org/10.1145/3477495.3531883

Kim, Youngwoo; Rahimi, Razieh; Allan, James (July 2022, Proceedings of The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 22))

Deep neural networks are widely used for text pair classification tasks such as as adhoc retrieval. These deep neural networks are not inherently interpretable and require additional efforts to get rationale behind their decisions. Existing explanation models are not yet capable of inducing alignments between the query terms and the document terms -- which part of the document rationales are responsible for which part of the query? In this paper, we study how the input perturbations can be used to infer or evaluate alignments between the query and document spans, which best explain the black-box ranker’s relevance prediction. We use different perturbation strategies and accordingly propose a set of metrics to evaluate the faithfulness of alignment rationales to the model. Our experiments show that defined metrics based on substitution-based perturbation are more successful in preferring higher-quality alignments, compared to the deletion-based metrics.
more » « less
Full Text Available
Query-driven Segment Selection for Ranking Long Documents

https://doi.org/10.1145/3459637.3482101

Kim, Youngwoo; Rahimi, Razieh; Bonab, Hamed; Allan, James (October 2021, Proceedings of The 30th ACM International Conference on Information and Knowledge Management (CIKM '21))

Transformer-based rankers have shown the state-of-the-art performance, but their self-attention operation is mostly unable to process long sequences. One of the common approaches to train these rankers is to heuristically select some segments of each document, such as the first segment, as training data. However, these segments may not contain the query-related parts of documents. To address this problem, we propose the query-driven segment selection from long documents to build training data for transformer-based rankers. The segment selector provides relevant samples with more accurate labels and non-relevant samples which are harder to be predicted. The experimental results show that the basic BERT-based ranker trained with the proposed segment selector significantly outperforms that trained by the heuristically selected segments, and performs equally to the state-of-the-art model with localized self-attention that can process longer input sequences. We also demonstrate that training with our segment selector, there is not much gain from feeding input sequences larger than 200 words. Our findings open up new opportunities to design efficient transformer-based rankers.
more » « less
Full Text Available
Accelerated Statistical Eye Diagram Estimation Method for Efficient Signal Integrity Analysis

https://doi.org/10.1109/ACCESS.2023.3336568

Park, Junyong; Kim, Youngwoo; Kim, Donghyun (January 2023, IEEE Access)
Explaining Text Matching on Neural Natural Language Inference

https://doi.org/10.1145/3418052

Kim, Youngwoo; Jang, Myungha; Allan, James (October 2020, ACM Transactions on Information Systems)
null (Ed.)
Natural language inference (NLI) is the task of detecting the existence of entailment or contradiction in a given sentence pair. Although NLI techniques could help numerous information retrieval tasks, most solutions for NLI are neural approaches whose lack of interpretability prohibits both straightforward integration and diagnosis for further improvement. We target the task of generating token-level explanations for NLI from a neural model. Many existing approaches for token-level explanation are either computationally costly or require additional annotations for training. In this article, we first introduce a novel method for training an explanation generator that does not require additional human labels. Instead, the explanation generator is trained with the objective of predicting how the model’s classification output will change when parts of the inputs are modified. Second, we propose to build an explanation generator in a multi-task learning setting along with the original NLI task so the explanation generator can utilize the model’s internal behavior. The experiment results suggest that the proposed explanation generator outperforms numerous strong baselines. In addition, our method does not require excessive additional computation at prediction time, which renders it an order of magnitude faster than the best-performing baseline.
more » « less
Full Text Available

Search for: All records